Cue-based dialogue act classification
نویسنده
چکیده
Cue-Based Dialogue Act Classification Nick Webb Supervisors: Yorick Wilks & Mark Hepple In this thesis, we will address three research questions relating to the discovery and use of cue phrases for Dialogue Act classification. Cue phrases are single words, or combinations of words in phrases, that can serve as reliable indicators of some discourse function. In our case, we are looking to automatically discover cue phrases in corpora that are useful for the detection of Dialogue Acts (da). Dialogue Acts are labels attached to utterances in dialogue that serve to concisely characterise a speaker’s intention in producing a particular utterance, a notion that most major theories of dialogue take as central. Our first research question is whether or not we can extract cue phrases automatically from a corpus. We apply a method of cue extraction to the switchboard corpus of annotated human-human dialogues, and experiment with thresholds to identify cue phrases. To determine if these automatically extracted cue phrases are reliable indicators of das, we created a novel, cue-based da classification model. In this model, our cue phrases are exploited directly, by determining if they appear in unseen dialogue utterances. This forms our second research question, and we extensively explore
منابع مشابه
Dialogue Act Recognition using Cue Phrases
Dialogue acts play an important role in modelling discourse phenomena in several components of modern dialogue systems. Many different features have been so far proposed for dialogue act recognition. In this report, we take a cue-based model approach, and use N grams in utterances in dialogue as cue phrases. In our experiment with the switchboard corpus, we obtained 57.1% classification accurac...
متن کاملAutomatic Extraction of Cue Phrases for Cross-Corpus Dialogue Act Classification
In this paper, we present an investigation into the use of cue phrases as a basis for dialogue act classification. We define what we mean by cue phrases, and describe how we extract them from a manually labelled corpus of dialogue. We describe one method of evaluating the usefulness of such cue phrases, by applying them directly as a classifier to unseen utterances. Once we have extracted cue p...
متن کاملInvestigating the Portability of Corpus-Derived Cue Phrases for Dialogue Act Classification
We present recent work in the area of Cross-Domain Dialogue Act tagging. Our experiments investigate the use of a simple dialogue act classifier based on purely intra-utterance features principally involving word n-gram cue phrases. We apply automatically extracted cues from one corpus to a new annotated data set, to determine the portability and generality of the cues we learn. We show that ou...
متن کاملError Analysis of Dialogue Act Classification
We are interested in the area of Dialogue Act (da) tagging. Identifying the dialogue acts of utterances is recognised as an important step towards understanding the content and nature of what speakers say. We have built a simple dialogue act classifier based on purely intrautterance features — principally word n-gram cue phrases. Although such a classifier performs surprisingly well, rivalling ...
متن کاملEmpirical determination of thresholds for optimal dialogue act classification
We present recent experiments which build on our work in the area of Dialogue Act (da) tagging. Identifying the dialogue acts of utterances is recognised as an important step towards understanding the content and nature of what speakers say. We describe a simple dialogue act classifier based on purely intra-utterance features — principally word n-gram cue phrases. Such a classifier performs sur...
متن کامل